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Coronaviruses encode large replicase polyproteins which are proteolytically processed by viral proteases to 
generate mature nonstructural proteins (nsps) that form the viral replication complex. Mouse hepatitis virus 
(MHV) replicase products nsp3, nsp4, and nsp6 are predicted to act as membrane anchors during assembly 
of the viral replication complexes. We report the first antibody-mediated Western blot detection of nsp6 from 
MHV-infected cells. The nsp6-specific peptide antiserum detected the replicase intermediate p150 (nsp4 to 
nsp11) and two nsp6 products of approximately 23 and 25 kDa. Analysis of nsp6 transmembrane topology 
revealed six membrane-spanning segments and a conserved hydrophobic domain in the C-terminal cytosolic 


tail. 


Coronaviruses are enveloped, positive-stranded RNA vi- 
ruses that sequester host cell membranes to assemble viral 
replication complexes in the cytoplasm of infected cells (2, 21). 
In the case of murine coronavirus mouse hepatitis virus 
(MHV), three viral proteases process the replicase polypro- 
teins (3, 4, 5, 9, 12, 13, 14, 16, 18, 19, 24, 26) into intermediates 
and 16 mature nonstructural protein (nsp) products (Fig. 1A). 
It is unclear whether the intermediate forms or the mature 
nsps are responsible for assembly of the viral replication com- 
plex. The replicase proteins nsp3, nsp4, and nsp6 contain trans- 
membrane (TM)-spanning sequences that are proposed to be 
important for sequestering endoplasmic reticulum (ER) mem- 
branes to form the double-membrane vesicles which are the site 
of viral RNA synthesis (11, 17). However, the mechanism used by 
the nsps to generate double-membrane vesicles is not yet under- 
stood. Recent reports (8, 15, 22, 23, 28) and the study presented 
here have unraveled the membrane topology of these nsps. nsp4 
is a glycoprotein with four TM domains (8, 22, 23, 28). nsp3 
anchors its 213-kDa multidomain protein to ER membranes, 
likely using two TM domains (15, 22). Recently, nsp6 was shown 
to contain six TM domains (22); however, the authors were un- 
able to resolve which of two C-terminal hydrophobic domains can 
act as the final membrane-spanning region. 

In this report, we show the first antibody-mediated detection 
of MHV-AS59 nsp6 in virally infected cells. We also report the 
TM topology of nsp6, as determined by glycosylation tagging 
and N-linked glycosylation sequence insertion mutagenesis ap- 
proaches, providing evidence that nsp6 contains six membrane- 
spanning segments with a large C-terminal tail exposed to the 
cytosol. Multiple alignment of the nsp6 amino acid sequences 
from each coronavirus group revealed a high level of conser- 
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vation at the C-terminal end, suggesting an evolutionarily con- 
served function. 

To detect nsp6 replicase protein in MHV-AS59-infected cells, 
we used a polyclonal rabbit antiserum directed against a pep- 
tide (PLGVYNYKISVQEL) from the C-terminal region of 
nsp6. We detected the replicase intermediate p150 (nsp4 to 
nsp11) and two nsp6-specific products of 23 and 25 kDa (Fig. 
1B, lane 2) in MHV-infected HeLa-MHVR (25) cells by West- 
ern blot analysis. We found similar mature products of nsp6 in 
MHV-infected murine cell lines 17Cl-1 and DBT (data not 
shown). The same MHV-infected cell lysate was used to detect 
nsp8 replicase protein with a specific antibody that also recog- 
nizes p150 (Fig. 1B, lane 4). The reason for the existence of 
multiple forms of nsp6 is currently unknown, although post- 
translational modification or alternative processing of nsp6 
cannot be ruled out at this point. Future experiments will be 
directed at purification and analysis of the two forms of nsp6 
detected here. 

To develop a framework for understanding the membrane 
topology of nsp6, we first performed nsp6 bioinformatics anal- 
ysis. Five out of the seven bioinformatics tools predicted that 
nsp6 would encode seven TM domains, whereas two programs 
predicted that it would encode eight TM domains (Fig. 2). 
However, because both the N and C termini of nsp6 must be 
processed in the cytosol by the viral 3C-like protease (3CLpro), 
we expected nsp6 to encode an even number of TM domains 
and established a consensus TM domain prediction for nsp6 
(Fig. 2, bottom row). The consensus provided a working model 
for generating plasmid DNA constructs for evaluating the 
membrane topology of MHV nsp6. First, we employed enhanced 
green fluorescent protein (EGFP) glycosylation tagging (EGFP 
glyc) experiments as previously used for determining the mem- 
brane topology of other viral replicase TM proteins (20, 22). This 
approach allowed us to determine the localization of the tagged 
protein based on the differences in the mobility of the endogly- 
cosidase H (endo H)-treated protein versus that of the untreated 
protein by the use of sodium dodecyl sulfate-polyacrylamide gel 
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FIG. 1. Schematic diagram of MHV RNA genome, indicating the proteolytic processing scheme of the replicase polyprotein and Western blot 
detection of MHV nsp6. (A) MHV-AS9 linear RNA genome with the canonical representation of replicase, structural, and accessory genes. The 
replicase polyprotein intermediates and mature nsps generated during processing are depicted. The mature nsp6 replicase protein (hatched box) 
and the antibodies used to detect nsp6 and nsp8 (solid black boxes) are indicated. aa’s, amino acids. (B) Western blot analysis of nsp6. Whole-cell 
lysates were prepared from mock-infected (M) and MHV-infected (1) HeLa-MHVR cells, and the lysates were separated by 12.5% SDS-PAGE. 


Products were detected by probing with nsp6- or nsp8-specific antibodies. 


electrophoresis (SDS-PAGE) analysis. Based on the consensus 
topology model (Fig. 3A) suggesting a maximum of eight TM 
domains, we generated a series of plasmid DNA constructs start- 
ing with the N-terminal putative TM1 domain, and successively 
larger constructs were fused at their carboxyl terminus in frame 
with EGFPglycV5. The plasmid DNAs were individually trans- 
fected into BsrT7 cells (6), and the newly synthesized fusion 
proteins were radiolabeled with 100 Ci of [*°S]methionine-cys- 
teine per ml from 20 to 22 h posttransfection. Chimeric proteins 
expressed from the cell lysates were immunoprecipitated with V5 
antibody, either endo H treated or mock treated, separated using 
12.5% SDS-PAGE, and analyzed by autoradiography as de- 
scribed previously (15). 

We found that fusion protein products expressed from the 
reporter constructs (nsp6-35glycV5, nsp6-86glycV5, and nsp- 


Topology | No. of! taqy | two | tw3 | Twa | rms | Tmo | Tm7 | TM8 
Program TMs 

SOSUI 7 | 7-29 | 37-59 | 65-82 | 111-133]143-165] 178-200] 211-233 
TMHMM | 7_ | 13-35 | 40-59 | 64-86 | 106-128 [135-156] 171-193| 206-228 
MEMSAT | 7 | 13-29 | 41-59 | 66-86 | 112-128]138-154] 162-182] 206-228 
TMPred 7 | 11-29 | 41-59 | 64-85 | 109-128]137-153 ] 163-188] 208-228 
PHOBIUS | 7 | 12-34 | 40-59 | 66-85 | 113-131]138-156] 162-187] 208-232 
CONPredit| 8 | 9-29 | 39-59 | 65-85 | 110-130]134-154 161-181]207-227 |264-284 
HMMToP | 8 | 9-28 | 41-59 | 66-85 | 112-128 135-153] 162-181| 206-228 |265-287 
Consensus | 8 7-35 | 37-59| 64-86 |106-133 |134-165|161-200|206-233/264-287, 


FIG. 2. Summary of TM predictions for MHV nsp6 obtained from 
membrane topology bioinformatics tools. The nsp6 amino acid sequence 
(amino acids 3637 to 3923 in the MHV A59 genome are numbered 1 to 287 
for simplicity) was analyzed for TM-spanning domains by the use of various 
bioinformatics tools, and the residue numbers with predicted TM domains 
are displayed. The consensus TM topology of MHV nsp6 used as a basis for 
the topology experiments is depicted at the bottom row (shaded in gray). 
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FIG. 3. Determining the topology of nsp6 by the use of EGFPglyc and insertion of glycosylation consensus sites. (A) Schematics of a working 
topology model of MHV nsp6 (obtained from our consensus experiments) and nsp6-EGFPglycV5 fusion constructs generated for endo H assay. 
(B) Metabolic labeling and endo H treatment of nsp6-EGFPglycV5 fusion proteins. The nsp6-EGFPglycV5 fusion proteins expressed in 
transfected BsrT7 cells were radiolabeled from 20 to 22 h posttransfection, and then cell lysates were subjected to immunoprecipitation with V5 
antibody, treated with endo H or left untreated, separated by 12.5% SDS-PAGE, and analyzed by autoradiography. (C) Map of plasmid DNA 
construct showing the sites of inserted glycosylation acceptor consensus sequences (NXS). The locations of glycosylation insertion in the nsp6-V5 
construct are represented, with the amino acid number at the site of insertion. (D) Metabolic labeling and endo H analysis of glycosylation sequence 
insertion expression constructs of nsp6-V5. The plasmid DNAs (iNsp6-V5 constructs) were transfected and analyzed as described for panel B. 
(E) MHV-AS9 nsp6 topology model, summarizing the results of EGFPglycVS and glycosylation sequence insertion experiments. Amino acid positions 
indicated by the symbol “Y” were glycosylated and were positive by endo H assay, whereas those positions tested but found not glycosylated and negative 
by endo H assay are depicted by solid black horizontal lines. The inserted glycosylation acceptor sequence positions precede the letter i. Selected charged 
residues are shown in white characters on a black background. K, lysine residues; R, arginine residues; E, glutamic acid residues. 
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FIG. 4. Multiple sequence alignment (MSA) and percent sequence identity of coronavirus nsp6. The nsp6 amino acid sequences of 18 different 
coronaviruses were obtained from PATRIC (http://patric.vbi.vt.edu/) and aligned using MUSCLE and ClustalW software. The experimentally 
determined TM domains of MHV-A59 nsp6 were used as a reference for alignment. Unshaded boxes indicate the conserved TM domains that 
aligned with other coronavirus nsp6 sequences; the conserved hydrophobic domain (CHD) predicted by all the topology programs is indicated by 
gray shading. The residues of the peptide against which the nsp6 antibody was raised are boxed, with residue designations shown in boldface. 
Putative sites for palmitoylation (cysteine residue[s]) within the GXCXG motif) and phosphorylation (serine 244 and tyrosine 250 in MHV-A59 
nsp6) are indicated. Percent identity (% ID) values are indicated. In MSA, the following notations were used: asterisk indicate invariant amino 
acids, colons indicate highly similar amino acids, and dots indicate similar amino acids. HCoV, human coronavirus; PHEV, porcine hemagglu- 
tinating encephalomyelitis virus; BCoV, bovine coronavirus; BatSARS, bat severe acute respiratory syndrome coronavirus; BatCoV, bat corona- 
virus; SARSCoV, severe acute respiratory syndrome coronavirus; FIPV, feline infectious peritonitis virus; PRCoV, porcine respiratory coronavirus; 
TGEV, transmissible gastroenteritis virus; PEDV, porcine epidemic diarrhea virus; IBV, infectious bronchitis virus. 


165glycV5) were glycosylated, as shown by sensitivity to endo 
H treatment, indicating that the C-terminal end of these chi- 
meric proteins must extend into the ER lumen (Fig. 3B, lanes 
4, 8, and 10). In contrast, the remaining reporter constructs 
were not sensitive to endo H treatment; therefore, the C-terminal 
end of the chimeric constructs must extend into the cytoplasm 
(Fig. 3B, lanes 6, 12, 14, 16, and 18). Thus, these results indicate 
the presence of three luminal loops in nsp6. Identical results were 
obtained when we used PNGaseF (data not shown), which indi- 
cates that the lack of endo H sensitivity was not attributable to the 
protein transiting through the Golgi body, thereby rendering the 
protein insensitive to endo H treatment. 


To further investigate nsp6 topology in detail, we exploited a 
glycosylation sequence insertion mutagenesis approach (7) to cre- 
ate acceptor sequences in the region between amino acids 86 and 
200 of nsp6 by the use of site-directed mutagenesis as described in 
reference 32 in order to independently investigate the topology, 
since bioinformatics predictions of the TM domains within this 
region differ (Fig. 2). Consensus glycosylation acceptor sites 
(NXS) were generated at four sites in the nsp6-V5 plasmid back- 
bone by introducing single-codon insertions as depicted in Fig. 
3C. All the glycosylation insertion constructs were expressed and 
analyzed by use of the endo H assay as described above. As 
expected, the parental nsp6-V5 protein is not glycosylated and did 


AINN NVDIHOIN NYSALSAM Aq SoZ ‘Zt Youve Uo /Bio-wse*IAl//:dyy wo. papeojumoq 


VOL. 83, 2009 


not show a mobility shift after endo H treatment (Fig. 3D, lanes 
1 and 2). In contrast, expression of 99iNsp6-V5 revealed evidence 
of endo H sensitivity (Fig. 3D, lanes 3 and 4), indicating the ER 
luminal localization of the N99 introduced into MHV nsp6. This 
result is in agreement with those obtained with the nsp6-86glycV5 
construct that is also endo H sensitive (Fig. 3B, lanes 7 and 8). 
The insertion of glycosylation acceptor sequences at other sites 
yielded endo H-negative results (lanes 6, 8, and 10), indicating the 
possibility that the introduced NX(S/T) motifs (i) are localized in 
the cytosol, (ii) are localized within the membrane, or (iii) are not 
used, as the glycosylation site is not at least 12 amino acids away 
from the end of the preceding TM and 14 amino acids away from 
the beginning of the following TM (12 + 14 rule), thus rendering 
it inaccessible for glycosylation (1, 7, 30). Our results confirm and 
extend the results of a recent study (22) in which authors were 
unable to resolve whether TM6 or TM7 acted as the final TM 
domain. Our results indicate that TM6 is the final TM domain for 
MHV nsp6. We propose a topology model of MHV-A59 nsp6 in 
Fig. 3E which is in accordance with the distribution of positively 
charged residues (positive inside rule; reviewed in reference 31), 
depicting the higher number of lysine and arginine residues facing 
the cytosolic side of the membrane and the majority of charged 
residues excluded from the TM domain. Taken together, the 
results presented above are consistent with a six-TM domain 
model of MHV nsp6. This report provides new information on 
the membrane topology of nsp6 and provides potential clues with 
respect to the assembly of the coronavirus replication complex. 

To determine whether the experimentally determined six-TM- 
spanning domain topology of MHV-A59 is conserved among 
coronaviruses, we performed MUSCLE (10) and ClustalW (29) 
multiple sequence alignment of nsp6 amino acid sequences rep- 
resenting group 1, group 2, and group 3 coronaviruses obtained 
from PATRIC (http://patric.vbi.vt.edu/) (27). The most striking 
observations were the amino acid sequence conservation in the C 
terminus of all nsp6 proteins and the conservation in the hydro- 
phobicities within the putative TM domains (Fig. 4). This analysis 
revealed several conserved sites that may be important for the 
function of nsp6. We designated the conserved region between 
TM2 and TM3 the “KH loop” because of the invariant lysine and 
histidine residues that are present in the cytosolic loop (Fig. 3E), 
although the function of these amino acids is not yet known. We 
also designated the hydrophobic region in the C-terminal tail the 
“conserved hydrophobic domain” (Fig. 4). We speculate that cys- 
teine residue(s) within the region we designated the “conserved 
G(X)C(X)G motif” may be modified by palmitoylation, indi- 
cating that this region of nsp6 may have important functions in 
establishing protein-protein or protein-membrane interactions 
during the assembly of the viral replication complex. Addition- 
ally, for the MHV-A59 nsp6 protein, the NetPhosK 1.0 server 
(http://www.cbs.dtu.dk/services/NetPhosK/) predicted serine 
and tyrosine residues (serine 244 and tyrosine 250; see Fig. 4) 
at the C-terminal region as sites of possible phosphorylation by 
epidermal growth factor receptor kinase and protein kinase C, 
respectively. Both predicted sites are highly conserved in all 
coronavirus nsp6 proteins (Fig. 4). Overall, our analysis re- 
vealed conserved features in the nsp6 C-terminal region whose 
importance in viral replication can be investigated using a 
coronavirus reverse genetics system. 
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